Semi-algebraic Tools in Stochastic Games
نویسندگان
چکیده
In this thesis we consider two-person zero-sum stochastic games with a special focus on how tools from the mathematical field of semi-algebraic geometry have been applied to these games. In the first of two parts of the thesis we introduce stochastic games and prove a complexity result about computing the value of a type of stochastic games called concurrent reachability games. We show that the value of a finite-state concurrent reachability game can be approximated to arbitrary precision in TFNP[NP], that is, in the polynomial time hierarchy. Previously, no better bound than PSPACE was known for this problem. In the second part we go into the parts of semi-algebraic geometry theory that have been applied to stochastic games, and we provide a survey of the results in stochastic games that have been proved using these tools. We then go on to use semi-algebraic geometry to show that for limiting average games, the most general class of stochastic games we consider, there is a particularly nice class of stationary strategies that we will call monomial strategies which are -optimal among stationary strategies. This implies that all concurrent reachability games have a monomial family of -optimal strategies. We follow this up by proving numerically precise versions of some sampling theorems from semi-algebraic geometry. We apply these to the results of [40] to get precise bounds for the patience of concurrent reachability games, and obtain bounds for the monomial strategies that we have proven to exist.
منابع مشابه
Definable Zero-Sum Stochastic Games
Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...
متن کاملLie Algebraic Methods in Optimal Control of Stochastic Systems with Exponential-of-integral Cost
The purpose of this paper is to formulate and study the optimal control of partially observed stochastic systems with exponential-of-integral-sample cost, known as risk-sensitive problems, using Lie algebraic tools. This leads to the introduction of the su cient statistic algebra, Ls, through which one can determine a priori the maximum order of the controller. When dim(Ls) < 1, the constructio...
متن کاملReal Algebraic Tools in Stochastic Games
In game theory and in particular in the theory of stochastic games, we encounter systems of polynomial equalities and inequalities. We start with a few examples. The first example relates to the minmax value and optimal strategies of a two-person zero-sum game with finitely many strategies. Consider a twoperson zero-sum game represented by a k ×m matrix A = (aij), 1 ≤ i ≤ k and 1 ≤ j ≤ m. The n...
متن کاملStochastic linear programming games with concave preferences
We study stochastic linear programming games: a class of stochastic cooperative games whose payoffs under any realization of uncertainty are determined by a specially structured linear program. These games can model a variety of settings, including inventory centralization and cooperative network fortification. We focus on the core of these games under an allocation scheme that determines how p...
متن کاملSemi-algebraic sets and equilibria of binary games
Any nonempty, compact, semi-algebraic set in [0, 1]n is the projection of the set of mixed equilibria of a finite game with 2 actions per player on its first n coordinates. A similar result follows for sets of equilibrium payoffs. The proofs are constructive and elementary.
متن کامل